MovieLens data sets were collected by the GroupLens Research Project at the University of Minnesota.
This data set consists of: * 100,000 ratings (1-5) from 943 users on 1682 movies. * Each user has rated at least 20 movies. * Simple demographic info for the users (age, gender, occupation, zip)
Let’s load the dataset
## user item rating timestamp
## Min. : 1.0 Min. : 1.0 Min. :1.00 Min. :1997-09-20
## 1st Qu.:254.0 1st Qu.: 175.0 1st Qu.:3.00 1st Qu.:1997-11-13
## Median :447.0 Median : 322.0 Median :4.00 Median :1997-12-22
## Mean :462.5 Mean : 425.5 Mean :3.53 Mean :1997-12-30
## 3rd Qu.:682.0 3rd Qu.: 631.0 3rd Qu.:4.00 3rd Qu.:1998-02-23
## Max. :943.0 Max. :1682.0 Max. :5.00 Max. :1998-04-22
#head(ratings)
datatable(ratings, filter = "top", rownames = FALSE, options = list(pageLength = 5, scrollX=T))## Warning in instance$preRenderHook(instance): It seems your data is too
## big for client-side DataTables. You may consider server-side processing:
## https://rstudio.github.io/DT/server.html
A work by Yu